Dual Pipe & Cross-Node All-to-All Communication
FP8 Training
Multi-Token Prediction & Inference (Prefilling & Decoding)
Reinforcement Learning on the Base Model
Reinforcement Learning with Cold Start
3fs